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Abstract: This work is concerned with the detection of a mixture distribution from a R- 
valucd sample. Given a sample Xi , . . . , X„ and an even density <p, our aim is to detect 
whether the sample distribution is </>(. — fi) for some unknown mean fj,, or is defined as a 
two-component mixture based on translations of (p. In a first time, a non-asymptotic testing 
procedure is proposed and we determine conditions under which the power of the test can 
be controlled. In a second time, the performances of our testing procedure are investigated 
in 'benchmark' asymptotic settings. A simulation study provides comparisons with classical 
procedures. 
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^ 1. Introduction 

'"^ In this paper, the detection problem of a mixture distribution from a M-valued sample is considered. 

I Let {Xi, . . . ,Xn) be i.i.d. random variables from an unknown distribution F. All along the paper, 

^ i^ is assumed to admit a density / w.r.t. the Lebesgue measure on M. The sample is said to be 

^^ distributed from a mixture when / belongs to 

(N 

g Ti^{xeR^il-e)(j)ix-tii)+e(j)ix-^i2); e €]0, 1[, (Aii,Ai2) e K', Mi < M2} (1) 

^^ where </>(.) denotes a density. In this paper, 0(.) is assumed to be an even known density, and when 

^D Gaussian mixtures are considered, (f>{.) — (f>G{-) with 

m 



</'G(a;) = ~7^^^P (^y ) '^^^ 



For a complete introduction about mixtures, we refer to McLachlan and Peel (2000). The two- 
^ component mixtures are often encountered in practice, for instance in biology and medecine. They 

allow to model situations where a population can be discrimined into two different groups. The first 
subpopulation is then assumed to be distributed following the density </>(. — mi) while the second 
one follows the density (j>{. — 112)- The probability that an observation Xi arises from the first (resp. 
the second) subpopulation is then modeled by 1 — e (resp. e). 

This model has been intensively studied and many pathes have been explored in order to provide 
a satisfying inference. In particular, the detection problem has attracted a lot of attention in the 
last two decades. The main goal is not to provide the best estimation of the parameters of interest 
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(e, fii, ^2) but rather to decide whether the incoming observations are following a mixture distribu- 
tion or not. In other words, one wants to detect if the sample of interest comes from a homogeneous 
or heterogeneous population. Let J^) be the density set defined as 

TQ = {xeR^(l){x- ^); ^ e M} . (2) 

Formally, one wants to test 

"/eJ-o" against "/e-^i"- (3) 

In various testing problems involving finite mixtures, the properties of the likelihood ratio test 
have been widely investigated. We can mention for instance Chernoff and Lander (1995), Dacunha- 
Castelle and Gassiat (1999), Azai's et al. (2009) or Garel (2007) among others. In all these papers, 
the main challenge is to determine the asymptotic behaviour of the likelihood ratio under the 
alternative hypothesis in order to investigate the power of the related test. Alternative methods 
have also been considered: modified likelihood ratio test by Chen et al. (2001), estimation of the 
L^ distance between density associated to null and alternative hypotheses by Charnigo and Sun 
(2004), EM approach in Chen and Li (2009) or tests based on the empirical characteristic function 
in Klar and Meintanis (2005). 

The main challenge related to the problem (3) is to find (optimal) conditions on {e, fj,i, fj,2) for 
which a prescribed second kind error can be achieved. The first study in this way is due to Ingstcr 
(1999) in the particular case where the mean fi under the null hypothesis is known, fii — fi in the 
alternatives and 0(.) corresponds to a Gaussian density. Similar results have also been obtained in 
Donoho and Jin (2004). In this last paper, the so-called Higher Criticism has been investigated. 
This algorithm is very powerful in the sense that it is easy to implement, and provides similar 
power than the usual likelihood ratio test. The asymptotic detection regions have been carefully 
investigated in two different asymptotic regimes: 

• the sparse regime where e ~ n^ and ^2 ~ l^-i ^ '\/2r log(n) with ^ < 5 < 1 and 

n— f+00 n— f+00 

< r < 1. In such a case, it is proved that the two hypotheses can be asymptotically separated 

if 

r > 5 - \ when \ < 5 <\ 

1 

r > (1 - Vl^^f when | < (5 < 1 



4 

n^r ii^HIt n ^' a ^ 1 on/^ i\ ^ <v ^ — 



• the dense regime where e ^ n and /i2 ^ /^i ^ n ^ with < 6 < ^ and < r < ^ 

In this framework, the separation is asymptotically possible if r < ^ ^ ^■ 

We refer for more details to Ingstcr (1999) and Donoho and Jin (2004). Jagcr and WcUncr (2007) 
proposed a family of tests based on the Renyi divergences which generalizes the procedure based 
on the Higher Criticism. 

We also mention that generalizations of this procedure to heteroscedastic mixtures have been 
proposed in Cai et al. (2011) while Cai et al. (2007) consider the problems of estimation and 
construction of confidence sets in sparse mixture models. Addario-Berry et al. (2010) determine non- 
asymptotic separation rates of testing for the contamination of a standard Gaussian vector in M" 
by non-zero mean components when the alternatives have particular combinatorial and geometric 
structures. More recently, Cai and Wu (2012) consider the detection of sparse mixtures in the 
situation where the density of the observations under the null hypothesis is fixed, but not necessarily 
Gaussian. 



Laurent et al. /Non-asymptotic detection of mixtures with unknown means 3 

In this paper, we consider a testing problem where the nuU hypothesis does not correspond to a 
fixed density but rather to the set of densities Fo defined by (2) which corresponds to a translation 
model. Thus the mean parameter /x under the null hypothesis is not assumed to be known. The 
considered alternative J-i corresponds to the set of densities that are mixtures of two densities 
of J-Q. Our aim is to decide whether the density / of the observations belongs to J"o or Fi. To 
this end, we introduce a new testing procedure based on the ordered statistics. Contrary to the 
Higher Criticism algorithm (Donoho and Jin, 2004), the main advantage of this procedure is that 
the knowledge of the mean ^ under Hq is not required. Since one can find densities in Fi that are 
arbitrary close to J-q, it is impossible to build a level-a test that achieves a prescribed power on the 
whole set Fi- Hence, we introduce subsets of J"i, denoted Fi[n, a, /3], over which our level-a test has 
a power greater than 1 — /3. The performances of our procedure are therefore non asymptotic. In the 
Gaussian case, this result is completed by a non-asymptotic lower bound, proving the optimality of 
our procedure (up to a logarithmic term). 

Then, the asymptotic performances of our testing procedure will be investigated considering 
Gaussian mixtures and the so-called sparse and dense regimes as Donoho and Jin (2004) and Cai 
et al. (2011). In particular, we will see that the detection regions are slightly different than in Cai 
et al. (2011), due to the fact that the mean /x under TJg is unknown. 

The paper is organized as follows. In Section 2, a testing procedure based on the ordered statistics 
is introduced. Then the non-asymptotic behaviour of this test is investigated and we propose a 
general separation set Fi[n, a, (S\. In Section 3, we provide non-asymptotic lower and upper bounds 
in a Gaussian mixture model that enhance the quasi-optimality of our separation set. An asymptotic 
study is proposed in Section 4. Some numerical simulations, providing a comparison with existing 
procedures are displayed in Section 5. Proofs are gathered in Section 6 and technical lemmas in 
Appendix. 

2. The testing procedure 

Recall that given an i.i.d. sample Xi, . . . ,Xn having a common density / w.r.t. the Lebesgue 
measure on M, our aim is to consider the testing problem Ho : f Cz Fq against Hi : f Cz Fi, namely 
to decide whether / corresponds to a given even density function cj) (up to a translation) or is 
defined as a two-component mixtures of translations of 0. 

In this context, one of the most popular testing procedure is the Higher Criticism introduced in 
Donoho and Jin (2004), whose asymptotic behaviour has been widely investigated (see also refer- 
ences above). Nevertheless, there exists up to our knowledge no description of the non-asymptotic 
performances of this algorithm. Moreover this procedure heavily depends on the knowledge of the 
mean under Hq. In this paper, we work in a slightly different framework in the sense that a trans- 
lation model under Hq is considered. 

In this section, a new testing procedure based on spacing of order statistics is proposed. The order 
statistics are denoted by A'(i) < ^(2) < • ■ • < -^(n)- The main underlying idea is that the spacing 
of these order statistics are free with respect to the mean under H^: for some k < I G {1, . . . ,n}, 
the mean value affects the spatial position of a given X/i^\, but not Xn\ — Xij^y Moreover, the 
distribution of the variables Xn\ — A^(fe) is known under Hq and has a different behavior under iJi, 
provided k and I are well-chosen. 

Let a e]0, 1[ be a fixed level. In the following, a level-a test function 1}}^ denotes a measurable 
function of (ATi, . . . , Xn) with value in {0, 1}, such that the null hypothesis is rejected if -^q = 1 
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and sup Vf[ipa = 1) < a. Assume that n > 2 and consider the subset /C„ of {1,2,..., n/2} defined 

as 

/C„ = {2^0<j<[log2(n/2)]}. 

Our test statistics is defined as 

*« := sup {lx(„_,+i,-X(fc)>g„„,J, (4) 

where, for all u g]0, 1[, q^^k is the (1 — u)-quantile of X(„_j,^x) ^ -^(k) under the null hypothesis and 
a„ = sup {u e]0, 1[, Vho (3fc e /C„, X(„_fe+i) - X(^k) > Qum) < a} . 

Note that since the distribution of X(„_j,^i) — X(j.) under the null hypothesis is independent of the 
mean value /x, qa„,k and a„ can be approximated (via Monte-Carlo simulations for instance) under 
the assumption that the Xi^s have common density (f). In the following we also provide explicit 
upper bounds for the quantiles, which can be used instead of the true Qa^k if necessary. 

By definition, the test statistics vPq is exactly of level a. We now want to evaluate the power of 
the test. More precisely, we want to define subsets of the alternative J^i over which the test has a 
prescribed power. We first need some definitions. 

Let $(x) = 1 — ^{x), where <& is the cumulative distribution function associated to the density 
function (p. For all a €]0, 1[ and k E {1, 2, . . . , n/2}, let ta,k be a positive real defined by 



$( ^-^\ = ^ 



'21og(^) 



(5) 



if fc > 21og(|;), and t^^k — +oo otherwise. For all a e]0, 1[, p > 0, and k G {1, 2, . . . , n/2}, we 
consider the subset S{a, p, k) of M^ defined by : 

{(£,^1,^2) e]0, l[xM^,/X2 > Mi; 3c e M such that : \ 

(l-£)f (t„,fc-c + £(A*2 -A*i)) + £*(<a,fe-c- (l-£)(Ai2-A*i)) > P \- (6) 
(1 - £)$ (C - £(/i2 - Ml)) + £$ (C + (1 - £)(m2 - Ml)) > P J 

When ta^k = +00, we use the convention S{a^ p,k) — $ for all p > 0. 

The following proposition highlights the non-asymptotic performances of the test ^a- 

Theorem 1. Let a g]0, 1[ and /3 e]0, 1 — a[. Consider the test ^^ described in (4). Consider the 
alternative sets 

Ti[n,a,/3] = < /(.) = (1 - e)(j){. - pi) + e(j){. - p2);ie, pi, P2) € [j S{a„,pik,n),k) \ 
I ke]c„ ) 

where S{am p{k,n),k) is defined by (6) with 

k 1 + VI + 2/C/3 



p{k,n) 



n nP 
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Then \E'q, is a level-a test and 

sup P/(*a=0)</3. 

In this theorem, we have defined a set ^i [n, a, /3] over which the level-a test statistics ^^ has a 
power greater than 1 — /3. This result holds for all n, it is non-asymptotic. The definition of the set 
S{a, p, k) is quite rough. Nevertheless, it will allow us to describe several situations for which the 
power of our testing procedure will be assessed, in both asymptotic and non-asymptotic cases. In 
the next section, explicit and sufficient conditions on (e, /ii, /12) are given, ensuring that the mixture 
density (1 — £)</'(. — /ii) + e(f>{. — fj.2) belongs to Ti[n,a,/3] when corresponds to the Gaussian 
density. 

3. A non-asymptotic framework 

The aim of this section is to provide sufficient and explicit conditions for which the triplet (e, /ii, (12) 
belongs to S{a,p,k). First, we introduce, for all p > and M > 0, the separation set J^i[/9, Af] 
defined as 

Ti[p, M] - {/(.) = (1 - e)0(. - p,) + e4>{. - M2), {e,Pi,P2) € S(p, M)} , 

where 

S{p, M) = {(e, /^i, P2) e]0, l[xM^ £(1 - £)(p2 - Pif >P, 0<H2-pi<M}. 

When the density of the standard normal distribution is considered {((> — (t>G)j the separation set is 
denoted J"i,g[p, M]. 

Remarks 1. In this setting, we assume that the difference between the means pi and p2 in the 
alternatives is bounded. This upper bound on p2 — Pi is necessary in the proof of Theorem 3 to get 
a uniform upper bound for the second kind error of the test over the set Ti^g{Pi M] for a suitable 
value of p. 

3.1. Lower hound for the detection of a Gaussian mixture model 

In this section, we consider the same definitions of non- asymptotic lower bounds for hypotheses 
testing problems than the ones introduced by Baraud (2002) for signal detection in a Gaussian 
regression model or Gaussian sequence model. We provide a non-asymptotic lower bound for our 
testing problem in the case where (j) corresponds to the standard Gaussian density. In particular, 
we exhibit values for p in 5(/9, M) for which the two hypotheses Hq and Hi cannot be separated 
with prescribed errors. 

Theorem 2. Let a €]0, 1[ and 13 €]0, 1 - a[. Let 



1 / l ~2\og[c{a,P)] log[c(a,/3)] 

C{M) i V n V 2n 
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with c(a, /3) = 1 - ^^~"~^^' and C(M) ^ ^\ + ^eA^'/4. Then for all p < p* , 

/3(J-i,g[p, a/]) := inf sup P/(^„ = 0) > ^, 

where the infimum is taken over all level-a test ip^- 

Theorem 2 implies that whatever the level-a test "00, if p < p*, there exists a density / G 
J^i^g[p, Af] for which P/(7/'ct = 0) > /3. In particular, testing is not possible if /i2 — Mi is too small 
with respect to e(l — e). We will show in the following that this condition on (e,/ii,/i2) is optimal 
(up to constant), namely if (e, pi, P2) G S{cp* , M), for some suitable constant c, then it is possible 
to construct a test and control the associated second kind error. 

3.2. A Testing procedure based on the variance 

In this paragraph, we are interested in a simple test based on the variance of the Xj's. We will 
prove that this test allows us to achieve the lower bound obtained in Theorem 2. 

Remark that under Hq, Yai{Xi) = 1 while under Hi, Vai{Xi) = 1 + £(1 — e){p2 — Mi)^- Hence, 
we consider the test ijja defined by 

1 " 
^" = 1{SJ>D„ „}, where Sl = ^{X, - X^f, (7) 

i=l 

and Va,n denotes the (1 — Q;)-quantile of the variable S^ under Hq. Then the following proposition 
holds. 

Proposition 1. Let a g]0, 1[ and /3 g]0, 1 — a[. Assume that the density function (j) has a finite 
fourth moment: K^[X'^] <B. Consider that f{.) = {1 — e)(l){. — pi) + e4>{. — P2) belongs to J-'ilp, M] 
with M > and 

p>C{a,P,M,B)/Vn, 

where C{a, (3, AI, B) is a positive constant; namely < p2 — Pi < M and e{l — s){p2 — PiY' > 
C{a,l3,M,B)/^. Then 

P/(^a = 0) < /3. 

In the Gaussian case, E^^ [X*] — 3. Hence, Proposition 1 assesses the optimality of the lower 
bound given in Theorem 2. Note that the value of p proposed in (8) differs from p* by constant. 
Finding optimal constant for our testing problem is a very difficult question that is out of the scope 
of this paper. For interested reader, we mention the work of Ingster (1999) in a slightly different 
(asymptotic) setting. 

3. 3. Upper bound for the testing procedure ^q, in the Gaussian case 

The goal of this section is to give explicit conditions on (e,pi,p2) that will ensure a prescribed 
power for the test ^^ defined in (4), when cj) is the standard Gaussian density. This will provide a 
better understanding of the behaviour of the set S{a, p, k) introduced in (6) in the Gaussian case. 
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Theorem 3. Let Xi, . . . ,X„ be i.i.d real random variables with common density f. Let a g]0, 1[ 
and consider the level-a test $„ defined by (4). Let (3 g]0, \ — a[ and M > 0. Assume that n fulfills 
n>2 and log(41og2(n)/a)/n < I>g(M)/36. 
Then, there exists a positive constant C(a, l3,M) depending only on a, /3 and M, such that if 



p>Cia,p,M)J'^'^''^, (8) 

V n 

J-"i^g[p, ill] C J^i[n,a, /3], which implies 

sup P/(^„ = 0) < /3. 
/e^i,G[p,M] 

Note that the value of p proposed in (8) differs from the lower bound p* by a term of order 
•y/loglogn. This log log term is due to the multiple (adaptive) testing procedure: the optimal value 
for k e K-n in the test ^q, is chosen from the data. Hence this -^/log log n term corresponds to the 
price to pay in such a setting. This kind of logarithmic loss is quite classical in test theory: see for 
instance Spokoiny (1996) or Fromont and Laurent (2006) in slightly different settings. 

The result given in Proposition 1 above seems even better than the one stated in Theorem 3 since 
the condition to have a powerful test is e(l — e)(/i2 — Mi)^ > G j ^fn instead of C-y/log log(n)/-y/n. 
Nevertheless, the test based on the variance would fail in the asymptotic sparse regime considered 
in the next section: this is not satisfying from a practical point of view since our aim is to provide 
a testing procedure which adapts to all possible situations. 

4. Asymptotic results 

The main conclusion of the previous (non-asymptotic) part, is that if e(l — e)(/i2 — Mi)^ is smaller 
than CoL,i3,M l\/n for a given constant Ca.j3,M and /i2 — Mi is bounded by M, then testing is impossible. 
In this part, an asymptotic point of view is adopted: we assume that n — > +oo and we would like 
to precise the allowed dependency of (e,/ii,/i2) with respect to n. In such a setting, the conclusion 
of Section 3 can be understood as follows: 

• if e = o(l/y^) as n -^ +oo, it is (at least) necessary that /12 ^ Mi ^^ +00 as n — > +00, so 
that the condition of Theorem 2 is not fulfilled, 

• if e >> l/\/n as n — >■ -|-cx), then we can allow fi2 — Mi to tend to 0. 

This justifies partially the frameworks considered in Donoho and Jin (2004), namely sparse and 
dense regimes. Indeed, we recall for the sake of convenience that the sparse regime is characterized 

by 

£ '^ n~^ and M2 — Mi ~ ^2rlog(ri) with - < 6 < 1 and < r < 1, (9) 

while the dense regime corresponds to situations where 

e ^ n~ and M2 ^ Mi ^ "■ '^ with < S < - and < r < -. (10) 

The notation a„ ~ b„ means that lim„_>.-|_oo cin/bn = 1- 

n— ^+00 

Below, we express conditions on 6 and r for which the second kind error of the test (4) can be 
controlled. 
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4.1. The dense case 

The results stated in Theorems 2 and 3 are non-asymptotic. Let us analyse the behaviour of the 
test ^a defined by (4) from an asymptotic point of view in the dense regime. 

Corollary 1. The detection boundary in the dense regime (10) is r*{S) — 4 — 2'' ^^^ detection is 
possible when r < r* (S) = ^ — ^ (for n large enough, the power of the test (4) is greater than l~ (3) 
and impossible if r > r*{S). 

The proof of Corollary 1 is omitted since it can be obviously deduced from Theorems 2 and 3. 

Results stated in Corollary 1 are therefore different from the one obtained in a dense regime in 
a contamination framework where Ho : f = (J)g{-) against Hi : f £ {(1 — e)(j)G{-) + £4>g{- — m); £ ^ 
]0, l[,/i G M.}. In this case, as mentioned in Introduction, the detection is possible in the dense 
regime for ?■ < | — (5 (see Ingster, 1999; Donoho and Jin, 2004). This difference is due to the fact 
that the mean under Hq is unknown, which makes the testing problem harder. 

4-2. Sparse case 

Using the same methodology, we can now analyse the performances of our testing procedure in the 
so-called asymptotic sparse regime. 

Theorem 4. Let Xi, . . . ,X„ be i.i.d real random variables with common density f. Let a e]0, 1[ 
and consider the level-a test "^a defined by (4). We consider the case where cf) = (J)q. 
We assume that the behaviour of (e,/ii,/i2) is governed by (9) and that r > r*{5) with 

{ 5-\ tfh<S<l 

r*iS)={ 

[ (1 - vi^r */ 1 < 5 < 1 

Then, setting /(.) = (1 — s)(f)G{- ^ f-i) + £</'g(- ^ 1^2), we have, for n large enough, 

P/(*a=0)</3. 

In the sparse regime, we recover exactly the separation boundaries that are already known in 
the case where the null hypothesis is reduced to a standard normal density, and the alternative is 
the mixture (1 — e)0G"(.) -t- £4>g{- ~ m)- Hence, the fact that the mean under Hq is unknown does 
not affect the difficulty of the related testing problem in this sparse regime. 

This proves the optimality of our procedure in the sparse regime. Indeed, the lower bounds 
established by Ingster (1999); Cai et al. (2011) in the case where the null hypothesis is reduced 
to the standard Gaussian density also provide lower bounds for our testing problem. This comes 
from the fact that a level-a test for our testing problem is also a level-a test for testing the null 
hypothesis "/ = 0g"- 

5. Simulation study 

In this section, we provide some numerical experiments in order to enhance the performances of our 
testing procedure ^q. Comparisons with the Higher Criticism and the Kolmogorov-Smirnov test 
are provided. Since these both procedures are not designed for the considered framework (translated 
model with unknown mean), straightforward modifications are proposed. 
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5. 1 . Contamination of 4>a 

In this section, the problem of detection of sparse heterogeneous mixtures as in Donoho and Jin 
(2004) is considered: Given (Xi, . . . , X„), i.i.d random variables with an unknown density function 
/, our aim is to test 

^0 : /(■) = 0g(-) against H^ : f e {x ^ {I ^ £)0G(a^) + eM^ -fi);fieR,e e]0, 1[}. (11) 

In this case, our testing procedure ^^ described in (4) can be easy adapted as follows: 

*a == sup {lX(„_,+ i,>g„„,J, 

where Qa,fc is the (1 — a)-quantilc of X/„_^^i\ under the nuh hypothesis, JCn = {2-^;0 < j < 
[log2(n/2)]} and 

a„ = sup{w e]0, 1[,Vh„ {3k e K-n-.X^n-k+i) > qu,k) < a}. 

For the sake of brevity, we do not exhibit a theoretical study of the performances of this procedure 
for the testing problem (11). Indeed, the methodology is rather close to the one proposed in this 
paper, up to some technical modifications. It is possible to see that this procedure achieves the 
optimal asymptotic separation set in both the dense and sparse regimes, as described in Donoho 
and Jin (2004). 

The power of our testing procedure is compared with the one of 

• Kolmogorov-Smirnov test: 

The level-a test function is tpKS.a = '^TKs>qKs a where 

Tks = supv^|F„(a;) - <^g{x)\ 

n 

with the empirical distribution function Fn{x) = - X) ^Xi<xi and qKS,a is the (1 — a) 

i=l '~ 

quantile of Tks under iJp . 

• Higher Criticism (Donoho and Jin, 2004): 

Let Pi = f'{Z > X,) where Z ^ A/'(0, 1) for alH e {1, . . . , n} and p(i) < p(2) < • • • < P(„). 
This test is based on 

HC=max^^^li^M. 

i^*^"\/pw(i-pco) 

The level-a test function is '4'HC,a = ^HOqnc » where qHC,a is the (1 — a) quantile of HC 
under Hq. 

In order to study the power of these testing procedures, a Monte-Carlo procedure is considered 
with N — 100000 samples of size n = 100 from a mixture distribution (1 — e)^G(.) -|-e^G(. — /x) with 
e e {0.05,0.15,0.25,0.35,0.45} and fi e [0,10]. The power functions of these testing procedures in 
the different scenarios are reported in Figure 1. 

It appears that our procedure performs as well as the Higher Criticism when e is small w.r.t. the 
size of the sample, while the Kolmogorov-Smirnov test possesses a bad behavior. Such a setting is 
close to the sparse regime. Nevertheless, the performances of the Higher Criticism deteriorates as e 
increases while the power of our test ^^ remains stable. 
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0.0 0.5 1.0 1.5 2.0 2.5 3.0 





Fig 1. Power function of the three considered testing procedures (continuous line for our test ^a, dashed line for 
Higher Criticism and dotted line for the Kolmogorov-Smirnov test) according to fi, for e = 0.05 (top-left), 0.15 (top 
right), 0.25 (middle left), 0.35 (middle right) and 0.45 (bottom left) in a contamination framework. 



Laurent et al. /Non-asymptotic detection of mixtures with unknown means 



11 




0.0 



~~i 1 1 1 1 r~ 

0.5 1.0 1.5 2.0 2.5 3.0 






Fig 2. Power function of the three considered testing procedures (continuous line for our test ^a, dashed line for 
Higher Criticism and dotted line for the Kolmogorov-Smirnov test) according to fi, for e = 0.05 (top-left), 0.15 (top 
right), 0.25 (middle left), 0.35 (middle right) and 0.45 (bottom left) in Gaussian mixture framework. 
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Fig 3. Power function of the three considered testing procedures (continuous line for our test ^a, dashed line for 
Higher Criticism and dotted line for the Kolmogorov-Smirnov test) according to fi, for e = 0.05 (top-left), 0.15 (top 
right), 0.25 (middle left), 0.35 (middle right) and 0.45 (bottom left) in Laplace mixture framework. 
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5.2. Gaussian mixtures with unknown means 

In this section, we deal with our testing problem. A simulation study is proposed in order to inves- 
tigate the power of our testing procedure "^ a described by (4). Our testing procedure is compared 
with the following adaptations of Kolmogorov-Smirnov test and Higher Criticism: 

• Kolmogorov-Smirnov test: 

The level-a test function is ^r-s « — 1^^ -^ - where 

Tks = supV^|F„(a;) ~<^g{x~X)\ 

_ n 

with the empirical mean X, the empirical distribution function Fn{x) = - X) ^Xi<x: and 

QKS,a is the (1 — a) quantile of TVs under Hq. 

• Higher Criticism (Donoho and Jin, 2004): 

Let Pi = P{Z-X > Xi) where Z ~ A/'(0, 1) for alH G {1, . . . , n} andp(i) < p(2) < . . . < P(„). 
This test is based on 

The level-a test function is ijJHC,a — 1hv5>- where qHC,a is the (1 — a) quantile of HC 

under Hq. 

In order to study the power of these testing procedures, a Monte-Carlo procedure is considered 
with N = 100000 samples of size n — 100 from a mixture distribution {l~e)(j>G{-) +£0g(-"M) with 
e £ {0.05,0.15,0.25,0.35,0.45} and /i S [0,10]. The power functions of these testing procedures in 
the different scenarios are reported in Figure 2. 

Once again, our testing procedure appears to be competitive w.r.t. the existing procedures, and 
even offers better performances in some particular cases. As in the previous experiment, the behavior 
of the Higher Criticism deteriorates w.r.t. our procedure as e increases, namely when we leave the 
sparse regime to the dense one. 

5.3. Laplace mixtures with unknown means 

Since our test ^Pq, is adapted for an even density function 0, a Laplace distribution is here considered: 
0(x) = I exp(— |a;|). As in Section 5.2, the power of V&q is compared with the one of Kolmogorov- 
Smirnov test and Higher Criticism. Note that these two last tests are adapted as in Section 5.2 but 
where $ and Z are now associated to the Laplace distribution. A Monte-Carlo procedure is proposed 
with N — 100000 samples of size n = 100 from a mixture distribution (1 — e)0(.) -I- e(j){. — fi) with 
e £ {0.05,0.15,0.25,0.35,0.45} and fi £ [0,10]. The power functions of these testing procedures 
in the different scenarios are reported in Figure 3. Apart in the case where e — 0.05, our test 
outperforms Higher Criticism and Kolmogorov-Smirnov in all other conditions. The power of Higher 
Criticism is deteriorated as e increases. 
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6. Proofs 

6.1. Proof of Theorem 1 

Proof. Following the definition of a„, ^^ is ensured to be a level-a test. 

In order to control the second kind error of the test ^q,, an upper bound for qa^.k is first given. 
Under the null hypothesis, there exists ji ^M. such that /(.) — (f>{. — pt). Thus X/^n-k+i) — ^(k) is 
distributed as Y(^ri-k+i) ~ ^(fc) where (Yi, . . . , y„) is a n sample from the density (/i(.). Hence, if we 
find CQ„,fc such that P(y(„-fe+i) - y(fc) > Ca^.k) < «« then Qq,.,*; < Ca,^,k- For all d g M, 

P(>"(n-fe+l) - Y(^k) > Ca,,,k) < nY{n-k+l) > Ca„,fc + d) + P{Y(^k) < d) . 

According to Lemma 1, if d fulfills $(d) < - 



2iog(s!r) 



the same lemma, if Ca,^^k is chosen such that ^{ca,^,k + d) < - 
'^a,^,k + d) < ^. Choosing d and Ca^.k exactly such that 



then P(Y(fc) <d) < ^. Moreover, by 
then P(Y(„_fc+i) > 



k 



$(d) = l'(c„„,fe+(i) 



'21og(^) 



and since </>(.) is an even continuous function, we obtain that d 

k 



Ca„,k 



Ca„M = ta^,k where <^{^^) 

thus qa„,k < ta^.k- 



1- 



21°g(4^) 
k 



2 . Finally, choosing 

PHo{X{n-k+i) - X(k) > ta^,k) < an and 



Considering / e J^i[n, a, /3], we want to control the second kind error of the test: 

P/(*„ - 0) = P/ (VA: e /C„,X(„_fc+i) - X(fe) < 9„„.fc) 
< inf P/ (X(„_fc+i) - X(fc) < 9a„,fc) • 



(12) 



Since / e J^i[n, a, /3], there exist e e]0, 1[ and (^i, ^2) G K^, ^1 < 112 such that 

Va; e M, /(x) = (1 - e)0(x - ^i) + e(j){x - ^2) 
and for some k G /C„, there exists a real c such that (e,/ii,/i2) fulfills the two following conditions: 
(l-e)l>(ta„,fc-c + e(^2-Mi))+£*(ia„,fc-c- (l-e)(Ai2-Aii)) > p(fc,"), (13) 



(l-e)$(c-e(^2-Mi))+£*(c+(l-£)(M2-Mi)) > p{k,n), (14) 



with /9(/s, n) 



J „ . Using (12) and the fact that qa^^k < ta„,k, 



k I l + N/l + 2fcff 



< 
< 



P/(^(„-fe+l) < ia„.fc + E/[Xi] - c) 
+P/(X(fc)>E/[Xi]-c). 



(15) 
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For the first term in tlic riglit-liand side of (15), 



15 



^ ^f\Yl {l{^.<t=„,fc+IE/[Xi]-c} - <?i} > n(l - qi) - k 



\i=l 



ith 



= (1 - e)$ (t„„,fc + E/[Xi] - c - Ail) + e$ (ia„,fc + E/[Xi] -0-^2) 
= (1 - e)$ (t„„,fe - c + e(/i2 - Ml)) + f^* (ia„,fc - c - (1 - £)(/i2 - A*i)) 

since Ey[Xi] = (1 — e)^^ + e/x2. Condition (13) gives that n(l — gi) — fc > and using Markov's 
inequality, 



Note tliat the inequality 



, "\^2 < 2 is fulfilled if and only if x 



n ^ i3n 



± 



Vl+2kp 
I3n 



Then, 



since Condition (13) ensures us that I — qi 



fe I J_ I v/l+2fc/3 
n n/3 n/3 



^/ (^(„-fe+l) < ia„,fc + E/[^l] - C) < 



/3 



For the second term in the right-hand side of (15), 

/ " > 

Vf (X(fe) > E/[Xi] -c)<¥f[Y, {l{X,>E,[X,]-c} -q2}>n{l- q^) - k 

\i=i J 

with 

92 = P/(Xi >E/[Xi]-c) 

= (1 - e)* (E/[Xi] - c - Ml) + e$ (E/[Xi] -0-^2) 

= (1 - £)!> (-C + e(M2 - Ml)) + £^ (-C - (1 - £)(M2 - Ml)) 

= (1 - £)$ (c - £(m2 - Ml)) + e$ (c + (1 - e)(M2 - Mi)) ■ 

Condition (14) gives that n(l — 92) — fc > and using Markov's inequality, 



-; (X(fe) > E/[Xi] - c) < . "(^ .'^^^,.^ . 
^ ^ ' ^ \n(\ — q2) — k\^ 



According to Condition (14), 1 — 92 ^ 



n ^ nl3 



± 



Vl+2kt3 
nP 



thus 



-f{X^k)>Ef[Xi]-c)< 



Finally, P/(*a = 0) < /?. 



n 
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6.2. Proof of Theorem 2 

Let %pa be a level-a test. For all / e J-i.gIPi Af]i 

P/(V'a = 0) = P0(^a = 0) + P/(l^„ = 0) - P0(^„ = 0) 

Thus for a density / e Fi^g[Pi ^i] which has to be specified after, 

sup P/(^a=0) > l-a-[P^(7/^„=0)-PKV'„-0)] 



16 



> l-a 



\TV 



where \\P — Q\\tv denotes the total variation distance between two probability distributions P and 



Q. Since |lP^-P^-||Ty < ^ 2[1 - A{(j), /)"] where A{(j), /) = j^ sj (f){x) f {x)dx is the Hellinger affinity 
between the two density functions (p ^-^d /, 



/3(J-i.G[p,Af]):=inf sup P/(V'„ = 0) > 1 - a - v/2[l - A((/., /)"]. 

If we specify a density / e Fi^g[Pi ^^ such that A{(f), /) > c(a, /3) " then /3(J^i^g[p, Af]) > 1 — a ■ 
(1 — a — /?) = /?. And, since 



^(<^,/)> 1-2^0 



V(X)-0(X) Y 



A{4>, /) > c(q:, /3) " is obtained if E^ 



( f(X)-4,(X) \ 

\ <P{x) ) 



< 2 



l-c{a,py^ 



In the sequel, we consider the density / = (1 — £)</>(. — pi) + £</>(. — ^^2), with 

(1 - e)^j.i = -e/i2 



max{fil,fil,\fiifi2\) <v^ = 



AP 



In particular, / e J'l^G'ip, M] since (/i2 — A*i)^ < A#^. 



(16) 

(17) 
(18) 



For this density choice, 



Erf 



' fiX)~<PiX) \ 
HX) 



<j>ix) 



dx 



{(1 - emx - Ml) - cl){x)] + e[cl){x - pt2) - H^W 



0(x) 



dx 



{l-ef 
+2e(l - e) 



dx — 1 



<P{x) 

}{X - fli)(f>{x - fl2) 

cPix) 



<P{x) 



dx — 1 



dx — 1 
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Considering the Gaussian density (</>(.) — cjjai-)), we have J^ 0G(a: ^^^^f) '^^ dx = exp(^i/i2), 



Ew 



V'(x)-fe(x) Y 



(l-e) 



e'^i -1 



e^2 _i 



+ 2e(l - e) [e''!^^ - 1] , 



Next, using that |e" - 1 - u - \u^\ < ^\u\^ for all |u| < U with Condition (17), 



Erf 



f{X) - (bajX) 
MX) 



< (1-e) 



+£" 



M? + 2^^! + ^M? 



1 



2 -1- 4 G 6 

M2 + 2^^2 + -^^2 



+ 2£(l-£) 



1 e'^ 

MiA*2 + 2A*iAi2 + -7^\^^l^J'2\^ 



+ ^[(l-£)lA^i|'+e|M2|']'. 
The parameters of / are constrained such that (1 — e)/ii + e/i2 — thus 

21 



E. 



( f{x) ~ cf>G{xy 

[ MX) 



< - [(1 - e)£(/i2 - m)']' + V {(1 - s)^l/^2 - Mi|'[£' + (1 - ef ]}' 



1 p^ 

2 + ^(M2-m) 



< (i-£)V(M2-Mir 

< C2(M) [(1 - e)£(M2 - Ml)"]' = C\M)p' 
with C2(M) = 5 + iAf2e^'^'/4. Moreover, if u < 0, 1 - e" > -u - ^u^ thus 1 - c(a,/3)- > 



i logc(a,/3) - i ('l^gfi^i^') . Then, the condition 



p=(l-£)e(Ai2-/ii)^< 



C(Af) V n 



2 ^ log c(a,/3) \ 
-logc(a,/3)- :=p 



implies that /3(J"i,g[p, M]) > (3. 

6.3. Proof of Proposition 1 

Following the definition of the threshold Wq.„, it is easy to see that tpa defined in (7) is a level-a 
test. Now, our aim is to upper bound the term 



^/(^a = 0)=P/(52<T;„^„) 
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when / e J"i [p, M] . 

In a first time, a control of Wq_„ is required. If a real Ca,n is determined such that Pijp(iS'^ > 
Ca,n) < a, then Uq,_„ < Cq^„. According to Wilks (1962, page 200), if Yi, . . . ,y„ are i.i.d. random 
variables such that E[(Yi - E[Yi])^] < +oo, then 



va^ [-^ E(^' - ^") J ^ ^ {^[(^1 - i^t^i]) 



41 !^var(yi)2 
n — 1 



Hence, since E0[X]'] < B, 



"HoiSl > C„.„) = Pffo(^n - 1 > Ca,„ " 1) 



In particular P//(,(S'^ > Cq^„) < a with Cq,_„ = 1 



< 



Var(^2) 



< 



B 



, and thus 



(19) 



Va,n < 1 

Please note that Ef[Sl] = Var(Xi) = 1 + e(l - e)(^2 - Mi)^- Hence, for all / e J"i[p, M], 
VCV-a =0) < ¥f\S-'^<l+ ' ^ 



-f{sl~E[Sl]<l 



-E[S, 



I] , 



if e(l-e)(/i2 -Mi)^ > 



< pJ|5f,-E[5fJ|>£(l-e)(M2-Mi)' 
^ Var(g,^J 

Using Equation (19), we get 

'/(V'a = 0) < ■ 



E/[(Xi-E[Xi])4] 



e(l~e)(Ai2-/ii)^ 
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In order to conclude, just remark that 
E/[(Xi - E[Xi])4] = (1 - e) / [x - (1 - e)^ll - e[i2fi>{x - fii)dx 

JR 

+e [x - {I - e)ni - e/i2]^(/>(a; - ii2)dx 

Jr 

= (1 - e) / b - £(M2 - fii)]'^(j){y)dy 

Jr 

+£ / [y + (1 - e){fi2 - pti)]U{y)dy 



Thus 



as soon as 



= E^[Z4] + 6£(1 - £)(m2 - Aii)'E^[z2] + [e(l - e)* + ^4(1 _ ^^^^^^ _ ^^)4 

< B + -VbM^ + M'^ < (A/2 + Vb)^. 
- 4 - ^ 






2^ C{a,f3,M,B) 



£(l-e)(Ai2-Mi) > 
for some positive constant C(a, /3, M, B). This concludes the proof of Proposition 1. 

6.4- Proof of Theorem 3 

Proof. Let /(.) = (1 — £)0g(- — Mi)+^0g(- — ^2)- We will prove that if (8) holds then / G Tx[n,a,l3] 
and the result will be a consequence of Theorem 1. Let j £ N* such that 2~^ < $g(M) < 2~^^~^\ 
In the following, we consider k £ /C„ such that 27tt < k < ■^. Note that, under the assumptions 
of Theorem 3, n > 2/$g(M) and n/2^ > 1. Note also that t|/C„ < log2(7T.), hence a„ > a/\jtJCn > 
a/log2(r7,). We will show that (6,^11,^2) £ S{an, p{k,n),k): Considering c — "2"'' ^^'^ denoting 
r = /i2 — /ii, we want to prove that 



(l-£)#G(^+erj+£$G(^^-(l-e)T) > p{k,n) (20) 

(l-£)$G(^-er)+e$G(^ + (l-e)r) > p{k,n) (21) 

hold, with p(fc,n) = | + ;ij + ^^ifM. 

We use a Taylor expansion at the order 2, the terms of order 1 vanish and this leads to : 

(1 _ e)^a ( V + "") + '^-^ (V -^^- ")" 

= $G (^) + ^(1 - £)er2 [e(-0'G(a)) + (1 - e){-cj^Gim 
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where a (resp. b) belongs to the interval 

We recall that $g (%^) = ^ 1 - \/ 
Moreover, it is easy to check that 



tari.k tan. 

2 ' 2 



■ ET 



(resp. 



21og(4/a„) 
k 



SinceA;<f,$G(%^) 



-(l-£)r,Vf) 

k 



> 



log(4/a„) 
23-in ■ 



1 



p{k,n) < 1 , 

Hence, in order to prove that (20) holds, we just have to show that 

'log(4/a„) , 4 



(1 _ e)eT' {e[~<P'Gia)] + (1 - e)h</.^(6)]} > 



23-^n nl3 V 2^-3^/3 



1 



(22) 



Next, we want to prove that °^''° — (1 — £)t, 
[ci(M),C2(M)] with ci(Af) > 0. First, 



ET 



remains included in a fixed interval 



^ - (1 - e)r > ^ - T > $5' (2-^) - M := ci(M) > 



since $g (%^) < ^ < 2"-' and <^g{M) > 2"^. Second, 



$f 



ta„.k 



k 

n 



k / 21og(4/a„) 
n V n 



> 



2J+1 



2J- 



1 /log(41og2(n)/a) 



> -$g(M) - \/2$g(M) 



$g(M) 



36 



> ^g{M) 



Thus 



T < $, 



,1 ($g(A/) 



+ A/ :=C2(M) 



Finally the function —(f>Q is bounded from below on this interval by some positive constant C{M) 



iin:re[ci(Af),c2(M)](-0G(a;))- This implies that (22) is satisfied if e(l-e)r2 > C(a, /3,M) ^'"^^^"^ 
for some suitable constant C(q;,/3,M). This concludes the proof of (20). The proof of (21) follows 
the same arguments. D 



6.5. Proof of Theorem 4 

Proof. We will prove that, under the assumptions of Theorem 4, / G J^i [n, a, /3] and the result will 
be a consequence of Theorem 1. We recall that tt/C„ < log2(n), hence a > a„ > Q;/tt/C„ > a/log2(n). 
We set T = fj,2 — fii and we have to prove that there exists k £ /C„ and c G K such that 



(l-e)$G(ia„,fc-c + er)+e$G(ia„,fc-c- (l-e)r) > pik,n) 
(l-e)$G(c-er)+e$G(c+(l-e)r) > p{k,n), 



(23) 
(24) 
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with p(fc, n) = ^ + ;^ + ^^nT'^ ■ Note that p(fc, n)<^+Cfi^ with Cp^^^ y|. We recall that 
ta„,k is defined by 

k 



$r 



21og(4/Q„) 
k 



In the following, we set Ca„ = •\/21og(4/Q;„). Since q;„ > a/ log2 (n) , note that < C^^ < 

(25) 



C(a)-\/loglog(n) for some constant C{a) depending only on a. We choose k G /C„ such that 

k 



n-y+oo log(n) log log(n) 



and we define 



in 



-cx) and lini — 

ri— i.+oo k 



L'a„ 



-CX) 



2 V fc 

For the sake of simplicity, we omit te dependency with respect to n in the notation of k and c. 
Let us first show that (24) holds for n large enough. First note that 

(1 - e)l>G (c - er) + e$G (c + (1 - e)T) > (1 - e)^G{c). 

With the assumptions on fc, we have that c > for n large enough since ta^^k ~^ +oo and C^^ /\k 
as n — 7> +00. Hence 

c^Ti ^k 



(26) 



*g(c) > $G 



-a,>G ^ 



Moreover, for all u > 0, 



hence 



*g(u) < -exp(-uV2) = W^</)g(u), 



¥ '- i>' ¥ 



This leads to 

(1 - e)$G(c) > (1 - e) (^1 + ^) $G ("^ 

After some obvious computations. Condition (24) is satisfied as soon as 



(1 - £)Ca„ 



2 ,\Vk k Vk 2Cl^ 



Since e < ^j \/n and k < n, we have e/c < -\/fc. We recall that Cq^ — > +oo as n — > +oo and with 
the assumptions on fc, we have that ykjCa^ -^ +oo as n — > +oo, and the above inequality holds 
for n large enough. 

It remains to prove that (23) is satisfied with the conditions on fc imposed by (25) and the value 
of c defined by (26). Let A satisfy 0<r<A<l, we choose fc e /C„ satisfying (25) and such that 
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v} ^ < k < 2n^ '^ log (n). Note that such values of k exist for n large enough. It follows from 
Lemma 2 that ta^^k/2 < ■y/2Alog(n). Fhst, 



^G{ta„,k-C+eT) = $ 



G 



> $ 



G 



^Q,i,fc 



> 



Vk 



-Ca„ + ST 



^^-+^0^4V 



^C.„+£r)0G(Y 



We have to give an upper bound for 0g ( -s^ j . We use the inequality 



this leads to 



Vm > 0, $g(m) > ( - - 4 ) (t>G{u), 



Vu > 0, (i>G{u) < -1, r$G(u) < U^'^Giu), 

u^ — 1 



provided that u^ — 1 > 1. This is the case, for n large enough for u — ta^.k/'^, hence we have 

3 



.Gl^^^l < 



< 



OLji.k 

2 



$ 



G 



V2Alog(n) 



3 k 



< 472[log(n)]^/ n 



7/2 ^-A 



Finally, we obtain that 



- , ^ k ^ Vk V2Ca 

n n \ Jk 



er j4V2[log(n)]^/^n-^. 



Second, we want to lower bound ^q {t^^^k — c — (1 — e)T). We have that 
^G(^a„,fc-c-(l-e)r) 



*= ¥WK-<'-'^ 



Cq„ + er 



> $G I \/2Alog(n) - T - 

> I'G (V2Alog(n) - V2rlog(n)) - I er + ^ "" j ,/>G(V2Alog(n) - V2rlog(n)) 



since r = •\/2r log(n). Moreover, since ^G('\/2Alog(n) — •\/2r log(n)) = (\/27r) ^n (^'^ ^' , and 
using again the inequality $g(w) ^ (- t)'^g('") which holds for all u > 0, we obtain that 



^G {to.„,k - c - (1 - e)r) > Cn-(^-^)' 



V'logM 



\^2C^ 
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for some positive constant C depending on A and r. Condition (23) is thus fulfilled if 



Cen 



-(VA-yF)" 



1 



V'logM 



£T 



kin 11 \ 



Ca 



eTJ4%/2[log( 



23 



n]\ n 



By (25), Ca^/Vk — o(l/-\/log(n)), and the left hand side of this inequality is equivalent as 
n — > +00 to Cen^'^^^^^^ / ^y\og{n) and the right hand side is equivalent as n -^ +oo to 
SCa,^ (log(n)) ' n~'^ /\/k. Hence, the condition (23) will be satisfied asymptotically if for some 

Ae]0,l], 

• If i < (5 < I and < r < | , we set A = 4r and the above condition becomes r > S — ^. 

• If ^ < i5 < I and r > j, the above condition is satisfied with A = 1 and no additional 
condition is required. 

3 



• If (5 > |, we set A = 1 and the above condition becomes r > (1 — \/l — S^ 
This concludes the proof of Theorem 4. 



a 



Appendix A: Lemmas for upper-bound 

Lemma 1. Let Yi, . . . ,Yn be n random variables with a cumulative distribution function F and the 
order statistics are denoted Y(i) < Y(2) < • • • , ^(n) • Let a e]0, 1[ and let k e {0, . . . , n — 1} such that 
k > 2 log ( — )• Let c and d be two reals such that 



k 



'21og(|) 
k 



F(d) V(l-F(c)) < 

Then P(Y(„_fc+i) > c) < a and V{Y(^k) < d) < a. 
Proof. 

ny{n-k+i)>c) - p(ilim>c}>fcj 

= P ( X^{l{y,>c} - [1 - F{c)]} >k-n[l- F{c 
According to Condition (27), 



/21og(^) 
k-n[l- F(c)] > fcW ^^^ > 0. 



Using a Bernstein's inequality, we get 

P(Y(„_fc+i)>c)<2exp 



1 {k-n[l-F{c)]f 
'2v + \{k-n[l~F{c)]) 



(27) 
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withi; = Er=iE[(l{y,>c}-[l-^(c)])'] = Elli Var (ly,>c) = nF {c)[l - F {c)] < n[l-F{c)]. Thus, 
3v + k-n[l~ F{c)] < 2n[l - F{c)] + k < 3k - 2kJ^^°^ < 3k. This huphes that 



P(y(„-fc+i)>c)<2exp 
In the same way, 



3(fc-n[l-F(c)])2"l 
2 



3fc 



< 2exp 



log 



= P (E{i{i'.>rf} - [1 - ^(^)]} < ^P(d) - k] . 

Since nF{d) — k <0 according to Condition (27), a Bernstein's inequality implies that 



nYik) <d)<V 



^{l{y,>4-[l-F(d)]} 



> k~nF{d) < 2exp 



1 [nF(d) - kY 



2v+ \[k-nF{d)] 



with V = Er=iE[(l{y.>d} - [1 - i^(d)])'] = ELi Var(r, > d) = nF(d)[l - F{d)] < nF{d). Thus, 
3w + /c - nF{d) < 2nF{d) + k < 3k - 2kJ ^^°f^^ < 3k. This implies that 



TP{Y(k) <d) <2 exp 



2 3k 



< 2 exp 



a 



n 



Lemma 2. //fc > 81og(4/a„) and ^ > n"'^ wzf/i A g]0, 1[, f/ien 

t„„,fc < 2v/2Alog(n). 
Proo/. 



$r 



- r"p 



21og(4/a„ 



1 M"„:* 



2 V 2 



thus 



exp 



1 lta„,k 



2 V 2 



< 



^_ / 21og(4/a„ 



H -1 



If fc > 81og(4/a„), then 



21og(4/a„) 
fc 



> 1 



which leads to ia„,fc < 2A/2Alog(n). 



n 
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